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The CONS microprocessor is a general purpose processor designed for convenient 
emulation of complex order codes, particularly those involving stacks and pointer 
manipulation. It is the central processor in the lisp machine project, where it 
interprets the bit-efficient 16 bit order code produced by the lisp compiler. The 
data paths of the machine are 32 bits wide. Each 42 bit wide micro-code instruction 
specifies two 32 bit data sources from a variety of internal scratchpad registers, 
and the two data-mainipulation instructions specify a destination address. 

The internal scratchpads include a IK pointer addressable RAM intended for storing 
the top of the emulated stack, in a manner similar to a cache. Since a large 
percentage of main memory references are to the stack, this should materially speed 
up the machine. 

The machine has a 12 bit program counter, which behaves much like that of a. 
traditional processor, allowing up to 4K of writeable microprogram memory. The 
control portion also includes a 32 location micro subroutine return stack. 

Memory is accessed through a two level virtual paging system, which maps 23 bit 
virtual addresses into a 20 bit physical address. 

There are four micro-instruction classes defined. Each specifies sources for the A 
and N busses, and optionally a destination. The four operations are: 

1. an ALU operation which performs adds, subtracts, and boolean operations 

2. a byte operation which performs byte extraction and deposit, as well as selective 
field substitution 

3. a conditional transfer instruction, conditional on the value of any bit accessible 
to the N bus, or the carry and equal flags of previous ALU operations 

4. a dispatch, which allows field extraction, masking, and dispatching to assigned 
locations depending upon the value in the resulting field 

There are several sources and destinations whose loading and use invoke special 
action by the microprocessor. These include the memory address and memory data 
registers, whose use initiates main memory cycles. 

Two of the ALU operations are conditionally of two forms, depending upon the low 
order bit in the Q register and the sign of the previous ALU result. These 
operations are used for MUS and DIS (multiply step and divide step). 

The main features of this machine which make it suitable for interpreting the lisp 
machine order code are its writable microcode, its very flexible dispatching and 
subroutining, its excellent byte manipulation abilities, and its internal stack 
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storage. A conscious attempt has been made to avoid features that ;™* Pe gS 
purpose. The goal is a machine that happens to be good at interpreting this 
particular order code. Hopefully, it can interpret others almost as well. 

Since the use or the term "micro" in refering to registers and instructions becomm 
redundant, its use will be dropped from here on in the manual. All instructions 
discussed are microinstructions. 
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The control section of the processor consists of a 12 bit program counter (the PC), a 
32 location PC stack (SPC), and a IK dispatch memory, used during the dispatch 
Instruction. Unlike some micro-processors, and like most traditional machines, the 
normal mode of operation is to execute the next sequential instruction. 

The processor uses single instruction look ahead, i.e. the lookup of the next 
instruction is overlapped with execution of the current one. This implies that 
transfer and dispatch instructions normally execute the following instruction, even 
if the branch was successful. Provision is made in these instructions to inhibit 
this execution (with the N bit), but the cycle it would have used will then be 
wasted. 



12 is a branch instruction to location of 110 



TIME ««> 



I I 

| fetch II | fetch 12 
I execute 10 j execute II 
I I 

I 
Fetch of branch --| 

Execute of branch 



Execute of (optionally) 
inhibited instruction -- 



I fetch 13 
execute 12 



| fetch 110 | fetch 111 
I execute 13 j execute 110 
I I 



Execute of instruction branched to 



Two op codes affect flow of control in the machine. The conditional jump specifies a 
new PC and transfer type in the jump instruction, while the dispatch instruction 
looks up the new PC and transfer type in the dispatch table (DPC). In either case,, 
the new PC is loaded into the PC register, and the operation specified by the three 
bit transfer type is performed. These operations are: 

N bit - If on, inhibits execution of the (physically) next instruction. 
The cycle that would have executed that instruction is wasted. 

P and R bits are decoded as follows: 
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Effect 



JUMP (no return saved) 

CALL (save PC+2 on the SPC stack) 
[PC+1 if the N bit is also on] 

1 RETURN (ignore new PC, pop a PC off the SPC and load it 
into the PC register) 

1 in JUHPS: 

WRITE (write contents of specified A and H scratchpads into 
to the microcode memory at the address of the PC specified 
in the instruction) 

in DISPATCHES: 

FALL through to next instruction (don't dispatch) 



The JUMP transfer type is the normal program transfer, without saving a return 
address. 

The CALL transfer type pushes the current PC, plus two, [plus 1 if the N bit is also 
on] onto the SPC stack. This stack is 32 locations long. It is the responsibility 
of the programmer to avoid overflows. 

The RETURN transfer type pops a return PC from the SPC stack and uses it in place of 
the PC specified in the instruction or dispatch table. 

The WRITE transfer type is the mechanism for writing instructions into the 
microprogram instruction store. The reason for its odd location in the instruction 
set is due to the way in which it operates. It causes the same operations as the 
CALL transfer type, resulting in the PC register being loaded with the address to be 
modified. Then, when the instruction RAM would normally be fetching the instruction 
to be executed from that location, a write pulse is generated, causing the data 
fetched from the A and M scratchpads to be written into the memory. It is required 
that the data be fetched from the scratchpads, and not some other source, including 
any of the pass around paths that might be in use. Meanwhile, the machine simulates 
a RETURN transfer instruction, causing instruction execution to proceed from where it 
left off. Note that this instruction requires use of a word on the SPC stack and 
requires an extra cycle. It is highly recommended that the N bit also be on during 
this instruction, since the processor will be executing a POP transfer type 
unconditionally during what should be the execution of the intruction following the 
write. If this does not conflict with other things that this following instruction 
specifies, then it may be executed. Care is required. 

An additional bit in each instruction (the POP bit) allows specification of 
simultaneous execution of a POP transfer type along with execution of any 
instruction. (I.e. it does the same thing as if this instruction, in addition to 
whatever else it does, executes a POP transfer type jump without the N bit on) It is 
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the responsibility of the programmer to avoid obvious conflicts in the use of this 
bit simultaneously with other types of transfers. 
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1.1 Dispatching 



The dispatch Instruction allows selection of any source available on the H 
multiplexor [see description of N bus sources in the Data Path section], and the 
dispatch on any sub-field of up to 7 bits from the selected word. The selected 
subfleld is ORed with the "dispatch offset* field of the instruction to produce a 10 
bit address. This address is used to look up a 12 bit PC and 3 bit transfer type in 
the dispatch ram. 



(DISPATCH (M-source (size bit-pos)) offset) 




14 '2 10 8 5 
I 2 | 2 | 2 | 3 | 
l_J_J I. 



Dispatch - — 

constant 
H source 



Dispatch Offset 
not used— — - 
Hlsc Function- 



not used--- --————- — 

# of bits to extract from N source- 
Pi rotate----—————— 



Dispatch RAH 



1141312 
I1UUI 
LLLL 
I I 
R bit (pop the SPC into the PC) ignores bits 0-11 — | I 



P bit (push the PC onto SPC) 

N bit (inhibit execution of following instruction)- 
new PC -— — — - ......... 



12 
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The dispatch constant field is loaded into the dispatch constant register (acessible 
from the N multiplexer) on every dispatch instruction. 

The N rotate field contains the rotate necessary to move the field being dispatched 
on into the low order bits of the word. The rotate is to the left. 

The miscellaneous functions are: 

Function 2: Load the dispatch ram. Inhibits the normal action of the instruction 

and instead loads the dispatch ram with the low order contents of the M memory 
scratchpad location specified in the source. The parity (bit 15) is also loaded, and 
it is the responsibility of the programmer to load correct parity into the ram. 
Normal addressing of the RAM is in effect, so you probably want the # of bits field 
equal' to zero to uniquely address a particular location. 

Function 3: Force the high order bit of the rotate field equal to one if the low 
order bit of the emulated PC is one. 
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1.2 Conditionals 



The branch conditions are of two major types. First, it is possible to test the 
state of any bit accessible to the M multiplexor by specifying the source and a shift 
which will leave the tested bit in the low order bit position of the output bus. 
This allows testing of all the flag bits, since they are accessible from the N 
multiplexor. The second type of conditional is the arithmetic conditional, where two 
operands are specified, and an ALU subtract is performed, resulting in tests of the 
carry, zero, and overflow bits in the ALU. This is useful for comparing two numbers. 
Other miscellaneous conditionals test the validity of the currently loaded main 
memory address with the memory map, and test for external interrupt conditions. 



Opcode 1 JUMP 



I 3938 
I 2 |1| 

I IJ. 

I I 

Op I 

I 

Pop--| 



30 
I 
.1. 



(IF (condition A-source M-source) (opcode destination)) 

.1 I I I I I I I 

24 12 10 9 8 7 6 5 

6 1 12 • 12 1111111 1|1| 5 



A source 

H source 

new PC 

Hisc Function 

R bit (pop SPC into PC) ignore new PC 

P bit (push PC onto SPC) 

N bit (inhibit execution of following instructions- 
Invert Sense of test if 1-- 



12 10 9 8 7 6 5 
I 2 1 1 1 1 1 1 1 1 1 1 1 

I LLIJJJ. 

I I 
I I 
I I 
I I 
I I 
I I 
i I 
I I 
I I 
I I 
I I 
I I 
I I 
I I 
I I 
I I 
I I 
-I I 



Test Bit (0) or Test Condition (1) 

H Rotate (if Test Bit) or Condition number - 



The tested conditions are: 

illegal 1 H<A H source less than A source 2 
less than or equal to A source 3 M=A A and N 
interrupt 5 pager fault 6 pager fault or interrupt 



HJA n source 
sources are equal 4 
7 always true 



The shift field contains the number of bits necessary to rotate the M source left 
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such that the tested bit is in bit position zero. That is, to test the sign bit, it 
should contain a 1. 

The miscellaneous functions are: 

Function 3: Force the high order bit of the shift field on if the low order bit 
of the emulated PC is a one. 
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2.0 Data paths 



The data paths of the machine consist of two source busses, which provide data to the 
ALU and Byte extractor, and an output bus which is selected from the ALU (optionally 
shifted left or right) or the output of the Byte extracter. We first describe the 
specification of the source busses, which are identically loaded for all instruction, 
then the destination specifiers which control where the data is stored, and finally, 
the two operations for controlling the ALU and the Byte extracter. 
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All instructions specify sources in the same way. There are two source busses in the 
machine, the A bus and the H bus. The A bus is driven only from the A scratchpad 
memory of 256 locations. The M bus is driven from the M scratchpad of 32 locations, 
or from up to six other sources. Among these sources are the main memory data, the 
PC stack (for restoring the state of the processor after traps), the internal stack 
buffer, a word of processor flag bits, and the Q register. Addresses for the A and H 
scratchpads are taken directly from the instruction. The alternate sources of data 
for the H source are specified with an additional bit in the H source field. 

lR<37-30> ■ A source address 

IR<29-24> * N source address 
If IR<29> » 

IR<28-24> a n scratchpad address 
If IR<29> > 1 

IR<28-24> 



1 

2 

12 

22 

32 

3 

4 

5 



» M multiplexor source 

- M scratchpad (illegal) 

- M scratchpad pass around path (illegal) 

- Main memory data 

- Main memory write data 

- VMA 

- Memory map data 

- Q register 

- PDL INDEX <9-0> PDL POINTER <19-10> 

- SPC <17-0>, SPC pointer <31-27> 

15 - as above, but also causes a pop of the SPC pointer 

6 - Dispatch constant <7-0> 

7 - PDL (pointer) POP 

17 - PDL (pointer) no POP 
27 - PDL (index) 
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2.2 Destinations 



The 10 bit destination field in the Byte and ALU instructions specifies where the 
result of the instruction is deposited. It is in one of two forms, depending upon 
the high order bit. The high order bit on indicates that the low order 8 bits are an 
address of an A memory location. If the high order bit is a zero, the remaining 9 
bit field is divided into two fields, a 4 bit register select field, and a 5 bit H 
scratchpad address. Both of the registers specified by these fields get written. 

IR<23-14> > destination 
If IR<23> » 1 

IR<21-14> ■ A scratchpad write address 
If IR<23> « 

IR< 22-19) z Register write address 
- none 
- Memory data 
Memory data (write now) 



1 
2 
3 
4 
5 
6 
7 
10 
11 



VMA 

VMA (read) 

VMA (write now) 

VMA (write map) 



PDL (pointer), push 

12 - PDL (index) 

13 - PDL index 

14 - PDL pointer 

15 - SPC (causes a push) 

16 - next instruction modifier, bits <44-24> 

17 - next instruction modifier, bits <23-0> 
IR<18-14> » M scratch pad write address 



Note: The Q-register is loaded using the ALU instruction. 

The conditional branch and dispatch instructions have no destination field. 
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The ALU operation performs most of the arithmetic in the machine. It specifies two 
sources of 32 bit numbers, and an operation to be performed by the ALU. The 
operation can be any of the 16 booleans, two's complement add, subtract (in one 
direction only), left shift, and several less useful operations. The carry into the 
ALU can be forced to a one or zero. Additionally, the ALU op specifies one of four 
operations upon the Q register. These are do nothing, shift left, shift right, and 
load from the output bus. An additional bit in the ALU operation field is decoded to 
indicate variable operations, and the operation performed with this bit set is 
determined partially by the low order bit in the Q register. This is how the MUS and 
DIS instructions are specified for bitwise multiplication and division. 

IR<40-39> = 

IR<38> = POP transfer 

IR<37-30> - A memory source 

IR<29-24> » M memory source and M mux control 

IR<23-14> s Destination 

IR<13-12> » output bus control 

- masker output (illegal) 

1 - ALU output 

2 - ALU output shifted right one (sign shifted in) 

3 - ALU shifted left one (high order bit of Q shifted in) 

IR<11-10> a Mlsc Function 

- 

1 - 

2 - 

3 - Load low order bit of emulated PC from ALU low order bit output 

IR<9-4> » ALU op 

If IR<9> = 

IR<8-4> « ALU op code 
If IR<9> a 1 

IR<8-4> a Conditional ALU op code 

a> multiply step 

1 ■> divide step 
IR<3-2> ■ Carry code 

- carry zero 

1 - carry one 

IR<l-0> a Q control 

- do nothing 

1 - shift Q left (ALU sign output shifted in) 

2 - shift Q right (ALU low order bit output shifted in) 

3 - load Q from output bus 

3 - Masker output (not particularly useful) 
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(opcode (A-source B-source) destination ) 




A source — 

R source and mux control 

destination--- 

output bus control—— 

Hisc Function 

ALU opcode 

Carry code -- 

Q control 



ALU bit operation functions (from Table 1 of 74181 specifications) 
(number in parentheses after arithmetic opcodes is the low order carry in) 
(all arithmetic operations are two's complement) 



boolean 







n 




1 


(A 


H) 


2 


( K) 


A 


3 







4 


(K 


A) 


5 


A 




6 


n xor 


A 


7 


H ( A) 


10 


( n) 


A 


11 


H eqv 


A 


12 


A 




13 


H A 




14 


1 




15 


n ( A) 


16 


n a 




17 n 





arithmetic 



H - A - 1 (carry clear) 



H ♦ A 

(H A) - 1 (carry clear) 
H ♦ H (carry clear) 



M - 1 (carry clear) 



H ♦ 1 (carry set) 

H - A (carry set) 
PI ♦ A ♦ 1 (carry set) 
H ♦ H + 1 (carry set) 
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The byte operation specifies two sources and a destination in the same way as the ALU 
operation, but the operation performed is one of selective insertion of a byte field 
of the M source into an equal length field in the A source. The rotation of the H 
source is specified by the ZR bit as either zero or equal to the contents of the 
ROTATE field. The rotation of the mask used to select the bits replaced is specified 
by the ZH bit as either zero or equal to the contents of the ROTATE field. The 
length of the mask field used for replacement is specified in the LENGTH field. The 
four states of the ZR and ZH bits yield the following operations: 

2R=0 ZM=0 : not useful (subset of other modes) 

ZR»0 ZM«1 : PDP-10 LDB instruction (except the unmasked bits are from the A source) 

ZR«1 ZPI»0 : Selective deposit of the masked field from one word into the same 
length and position byte in the second word. 

ZR»1 ZH=1 : PDP-10 DPB instruction 

Byte operations automatically assert output bus mux source from the masker output. 

IR<40-39> = 3 

IR<38> = POP transfer 

IR<37-30> s A memory source 

IR<29-24> = M memory source and H mux control 

IR<23-14> a Destination 

IR<13> = ZM 

IR<12> » ZR 

1R<11-10> » Misc Function 

3 - force high order bit of the ROTATE field to a one if the low order bit 
bit of the emulated PC is a one. 
IR<9-5> » Length of mask byte - 1 (i.e. zero «> one bit) 
IR<4-0> * Rotation of mask or H source 
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Opcode 3 BYTE (BYTE (size (A-source bit-pos) (M-source bit-pos)) destination ) 



| 3938 
I 2 |1| 

I IJ. 

I I 

Op I 

I 

POP— | 



A source-- 

N source and mux control- 
destination 

ZR.ZN 

Hisc Function — 

Length —————— 

Rotate- 




1 


1 1 




1 


1 


14 12 10 

1 2 | 2 | 

J 1 1- 


5 


9 
1 
1 


5 



1 
1 
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A novel technique is used for variabilizing fields in the program instruction. Two 
of the "destinations" of the output bus are (conceptual) registers, whose contents 
get ORed with the next instruction executed. Combined with' the shifter/masker 
ability to move any contiguous set of bits into an arbitrary field, this feature 
provides, for example, variable rotates and the ability to use program determined 
addresses of registers. The micro-instruction is divided between bits 23 and 24, 
which is a natural dividing line for all microinstructions. 



DRAFT 



Trie CONS flicro Processor aflJsn* 
4.0 Clocks 



August 28, 1975 23:29 Clocks-2B 



This processor uses only one clock, occuring at the end of every cycle. This clock 
loads output data Into the designated registers, and a new PC and instruction is 
latched. The only events which do not take place synchronous with the clock are the 
control signals for the A and M scratchpads in the processor and the PC stack. For 
these devices, a two stage cycle is performed. During the first phase, the source 
addresses of the respective devices are gated into the address registers. After the 
output data has settled, the outputs of these devices are latched. Then, the address 
is changed to that specified as the write location from the previous*- instruction. 
After the address has settled, a write pulse is generated for the scratchpad memory 
to perform the write. A pass-around path is provided (invisibly to the programmer) 
which notices and corrects read references to a location which was written into on 
the previous cycle, but not yet actually written into the scratchpad. 

Timing of scratchpad references 

Instruction Instruction 1 

I fetch for 10 store for 1-1 I fetch for II store for 10 | 



I 



time »«»> 
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Output Bus 
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